States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning｜Neuron（2010） - 183Lab

States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning｜Neuron（2010）

Jan Gläscher, Nathaniel D. Daw, Peter Dayan, John P. O'Dohert

DOI: https://doi.org/10.1016/j.neuron.2010.04.016

強化学習(Reinforcement Learning; RL)

モデルフリー強化学習(Model-Free RL)

予測報酬誤差(Reward-Prediction Error; RPE)

腹側線条体(ventral striatum)

腹側被蓋野(ventral tegmental area; VTA)

モデルベース強化学習(Model-Based RL)

状態予測誤差(State Prediction Error)

前頭前皮質(prefrontal cortex; PFC)

頭頂間溝(Intraparietal sulcus; IPS)